Fast Labeling and Transcription with the Speechalyzer Toolkit

نویسنده

  • Felix Burkhardt
چکیده

We describe a software tool named “Speechalyzer” which is optimized to process large speech data sets with respect to transcription, labeling and annotation. It is implemented as a client server based framework in Java and interfaces software for speech recognition, synthesis, speech classification and quality evaluation. The application is mainly the processing of training data for speech recognition and classification models and performing benchmarking tests on speech to text, text to speech and speech categorization software systems.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An investigation of neutron direct damages at energies of 0.1-2 MeV on the DNA molecules with atomic structure deduced using Geant4 toolkit

This study proposes a method to estimate RBE of fast neutrons using Monte Carlo simulations. This approach is based on the combination of an atomic resolution DNA geometrical model and Monte Carlo simulations for tracking particles. Atomic positions were extracted from the Protein Data Bank. The GEANT4 code was used for tracking the secondary particles generated by fast neutrons during their in...

متن کامل

Calculation of Positron Distribution in the Presence of a Uniform Magnetic Field for the Improvement of Positron Emission Tomography (PET) Imaging Using GEANT4 Toolkit

Introduction Range and diffusion of positron-emitting radiopharmaceuticals are important parameters for image resolution in positron emission tomography (PET). In this study, GEANT4 toolkit was applied to study positron diffusion in soft tissues with and without a magnetic field for six commonly used isotopes in PET imaging including 11C, 13N, 15O, 18F, 68Ga, and 82Rb. Materials and Methods GEA...

متن کامل

NiuParser: A Chinese Syntactic and Semantic Parsing Toolkit

We present a new toolkit NiuParser for Chinese syntactic and semantic analysis. It can handle a wide range of Natural Language Processing (NLP) tasks in Chinese, including word segmentation, partof-speech tagging, named entity recognition, chunking, constituent parsing, dependency parsing, and semantic role labeling. The NiuParser system runs fast and shows state-of-the-art performance on sever...

متن کامل

A Taxonomy of Specific Problem Classes in Text-to-Speech Synthesis: Comparing Commercial and Open Source Performance

Current state-of-the-art speech synthesizers for domain-independent systems still struggle with the challenge of generating understandable and natural-sounding speech. This is mainly because the pronunciation of words of foreign origin, inflections and compound words often cannot be handled by rules. Furthermore there are too many of these for inclusion in exception dictionaries. We describe an...

متن کامل

Investigation of the direct DNA damages irradiated by protons of different energies using geant4-DNA toolkit

Background: The total yields of direct Single-Strand Breaks (SSBs) and Double-Strand Breaks (DSBs) in proton energies varying from 0.1 to 40 MeV were calculated. While other studies in this field have not used protons with energy less than 0.5 MeV, our results show interesting and complicated behavior of these protons. Materials and Methods: The simulation has been done using the Geant4-DNA too...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012